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min, is expressed and secreted in yeast 
which has been mutated to lack the yeast 
aspartyl protease 3 (Yap3p) or its equiva- . 
lent, thereby reducing the production of a 
45kD albumin fragment. A further reduc- 
tion is achieved by additionally deleting the 
Kex2p function. Alternatively, a modified 
albumin is prepared which is not suscep- 
tible to Yap3p cleavage, for example hu- 
man albumin which is R410A, K413Q and 
K414Q. 
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YEAST STRAINS AWn Mnn m A T rttiv^s 



Field of f he invention 



5 



The present invention relates to the production of recombinant human albumin 
(rHA) by yeast species. 

Backgrou nd and p rint- ^ 



Human sen™ albumin (HSA) is a protein of 585 amino acids that is 
response for a significant proportion of me osmotic pressure of serum, and 
also functions as a carrier of endogenous and exogenous Hgands. I, is used 
ctacally in the treatment of patients with severe burns, shock, or Wood ioss 
and at present is produced commercial by extraction from human blood The 
15 production of recombinant human albumin (rHA) in microorganisms has been 
disclosed in EP 330 451 and EP 361 991. 

In recent years yes, species have been wide.y used as a host orgamsms for the 
production of heterologous proteins (reviewed by Romanos « al 1992) 
20 includingrHA(Sleep« a ,,,990,,991 ;H eer« aU 99.). Veasts are readi* 
amenable to gen«ic manipulation, can be grown ,„ high cell density on simple 
medta, and as eukaryotes are suitable for production of secreted as well as 
cytosohc proteins. 

25 When S. cer^ae is utilised to produce rHA. the major secreted protein is 
mature 67kDa a.bumin. However, a 45kDa N-termina, fragment of rHA is 
also observed (Sleep « al, 1990). A similar fragment is obtained when rHA 
n expressed in Klu^ercmyces sp. (Fleer e, al, 1991) ami Pichia panaris (EP 
510 693). The fragment has the same N-terminal amino acid sequence as 
mantre rHA, bu, the carboxy terminus is heterogeneous and occurs between 
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Phe 403 and Val 409 with the most common termini being Leu 407 and Val 409 
(Geisow et al, 1991), as shown below. 



-Phe-Gln-Asn-Ala-Uu^ 

4 °5 410 415 



The amount of fragment produced, as a percentage of total rHA secreted 
10 vanes with both the strain and the secretion leader sequence utilised, but is 
never reduced to zero (Sleep et al, 1990). We have also found that the amount 
of fragment produced in high cell density fermentation (75-100g/L cell dry 
weight) is approximately five times higher than in shake flask cultures. 

15 The 45kDa albumin fragment is not observed in serum-derived human serum 
albumin (HSA), and its presence as non-nature-identical material in the 
recombinant product is undesirable. The problem addressed by the present 
invention is to reduce the amount of the 45kDa fragment in the product The 
simplest and most obvious approach would have been to have purified it away 
20 from the full length albumin, as proposed by Gist-brocades in EP 524 681 (see 
especially page 4, lines 17-22). However, we have chosen a different 
approach, namely to try to avoid its production in the first place. 

Sleep * al (1990) postulated that rHA fragment is produced within the cell and 
25 not the result of extra-cellular proteolysis. These authors codon-optimised 
the HSA cDNA from Glu- to Ser«> but this had no effect on production of 
rHA fragment. They noted that a potential Kex2p processing site in the rHA 
amino acid sequence, Lys-Lys- is in close proximity to the heterogeneous 
carboxy terminus of the fragment, but neither use of a kex2 host strain (ie a 
30 strain harbouring a mutation in the KEX2 gene such that it does not produce the 
Kex2p protease), nor removal of the potential cleavage site by site-directed 
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mutagenesis of the codon for Lys 4 ' 4 , resulted in reduction in the amount of the 
fragment. 

There is a vast array of yeast proteases which could, in principle, be degrading 
5 a desired protein product, including (in S. cerevisiae) yscA, yscB, yscY, yscS, 
other vacuolar proteinases, yscD, yscE, yscF (equivalent to kex2p), ysca,' 
yscIV, yscG, yscH, yscJ, yscE and kexl. 

Bourbonnais et al (1991) described an S. cerevisiae endoprotease activity 
10 specific for monobasic sites, an example of which (Arg 4 '°) exists in this region 
of albumin. This activity was later found to be attributable to yeast aspartyl 
protease 3 (Yap3) (Bourbonnais et al, 1993), an enzyme which was originally 
described by Egel-Mitani et al (1990) as an endoprotease similar to Kex2p in 
specificity, in that it cleaved at paired basic residues. Further work suggested 
15 that Yap3p is able to cleave monobasic sites and between, and C-terminal to, 
pairs of basic residues, but that cleavage at both types of sites is dependent on 
the sequence context (Azaryan et al, 1993; Cawley et al, 1993). 

As already discussed, the region of the C-terminus of rHA fragment contains 
20 both a monobasic (Arg 4 ">) and a dibasic site (Lys 4I3 Lys 4 ' 4 ). However, even 
though a Kex2p-like proteolytic activity is present in human cells and is 
responsible for cleavage of the pro sequence of HSA C-terminal to a pair of 
arginine residues, the fragment discussed above is not known to be produced 
in humans. This indicates that the basic residues Arg 410 , Lys 413 and Lys 4 ' 4 are 
25 not recognised by this Kex2p-like protease, in turn suggesting that this region 
of the molecule may not be accessible to proteases in the secretory pathway. 
Thus, the Yap3p protease could not have been predicted to be responsible for 
the production of the 45kDa fragment. In addition, Egel-Mitani et al (1990 
Yeast 6, 127-137) had shown Yap3p to be similar to Kex2p in cleaving the 
30 MFcr propheromone. Since removal of the Kex2p function alone does not 
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reduce the amount of the fragment produced, there was no reason to suppose 
that removal of the Ya P 3 P function would be beneficial. Indeed, Bourbonnais 
et al (1993) showed that yap3 strains had a decreased ability to process pro- 
somatostatin, and therefore taught away from using yap3 strains in the 
production of heterologous proteins. 

Summary n f the invention 

The solution to the problem identified above is, in accordance with the 
invention, to avoid or at least reduce production of the fragment in the initial 
fermentation, rather than to remove it during purification of the albumin. We 
have now found that, out of the 20 or more yeast proteases which are so far 
known to exist, it is in fact the Yap3p protease which is largely responsible for 
the 45kD fragment of rHA produced in yeast. The present invention provides 
15 a method for substantially reducing the amount of a 45kDa fragment produced 
when rHA is secreted from yeast species. The reduction in the amount of 
fragment both improves recovery of rHA during the purification process, and 
provides a higher quality of final product. A further, and completely 
unexpected, benefit of using yap3 strains of yeast is that they can produce 30- 
20 50% more rHA than strains having the Yap3p function. This benefit cannot be 
accounted for merely by the reduction of rHA fragment from ~ 15% to 3-5%. 

Thus, one aspect of the present invention provides a process for preparing 
albumin by secretion from a yeast genetically modified to produce and secrete 
25 the albumin, comprising culturing the yeast in a culture medium such that 
albumin is secreted into the medium, characterised in that the yeast cells have 
a reduced level of yeast aspartyl protease 3 proteolytic activity. 

Preferably, the said proteolytic activity is an endoprotease activity specific for 
30 monobasic sites and for paired basic amino acids in a polypeptide. 
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5 

Suitably, the yeast is S. cerevisiae which lacks a functional YAPS gene. 
However, the invention is not limited to the use of S. cerevisiae, since the 
problem of 45 kDa fragment production is found also in other yeast genera, for 
example Pichia and Kluyveromyces, which shows that they have equivalent 
proteases (ie Yap3p proteolytic activity); see Clerc et al (1994), page 253. We 
have confirmed this by hybridisation analysis to locate homologues of Ya P 3p 
in non-Saccharomyces genera. A gene is regarded as a homologue, in general, 
if the sequence of the translation product has greater than 50% sequent 
identity to Yap3p. In non-Saccharomyces genera, the Yap3p-like protease and 
its gene may be named differently, but this does hot of course alter their 
essential nature. 



The level of fragment can be reduced still further if, as well as substantially 
eliminating the Ya P 3 P proteolytic activity, the Kex2p function is also 
15 substantially eliminated even though, as mentioned above, elimination of the 
Kex2p function alone does not affect the level of fragment. As in the case of 
Yap3p, the Kex2p function is not restricted to Saccharomyces; see Gellissen et 
al (1992), especially the sentence bridging pages 415 and 416, showing that 
Pichia has a Kex2p function. The genes encoding the Kex2p equivalent activity 
20 in Kluyveromyces lactis and Yarroma lipolytica have been cloned (Tanguy- 
Rougeau et al, 1988; Enderlin & Ogrydziak, 1994). 

A suitable means of eliminating the activity of a protease is to disrupt the host 
gene encoding the protease, thereby generating a non-reverting strain missing 

25 all or part of the gene for the protease (Rothstein, 1983). Alternatively, the 
activity can be reduced or eliminated by classical mutagenesis procedures or by 
the introduction of specific point mutations by the process of transplacement 
(Winston et al, 1983). Preferably, the activity of the enzyme is reduced to at 
most 50% of the wild-type level, more preferably no more than 25%, 10% or 

30 5%, and most preferably is undetectable. The level of Yap3p proteolytic 
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activity may be measured by defining t he production of the 45 kDa 
fragment, or by .he "'HJ.-lipoprotein assay of Azaryan e, at (1993), also used 
by Cawley « „ (1993). Kex2 P proteolytic activity may similarly be measured 
by known assays, for example as set out in Fuller a al (1989). 

The albumin may be a human albumin, or a variant thereof, or albumin from 

any other animal. 

By "variants- „ e iriclude insenions ^.^ ^ ^ 

10 conservativeornon-conservative, where such changes do not substantially alter 
to oncotic, useful ligand-binding or non-immunogenic properties of albumin 
In particular, we include naturally-occurring polymorphic variants of human 
albumm; fragments of human albumin which include the region cleaved by 
Yap3 P , for example those fragments disclosed in EP 322 094 (namely HSA (1 
15 n), where » is 369 to 419) which are sufficiently long ,„ ilKlud e the Yap3p- 
cleaved region (ie where „ is 403 to 4.9); and fusions of albumin (or Yap3„ 
citable portions thereof) with other proteins, for example the kind disclosed 
in WO 90/13653. 

20 By "conservative substitutions" is intended swaps within groups such as Gly 
Ala; Val, He, Uu; Asp, Glu; Asn, G.n; Ser, Thr; Lys, Arg; and Phe, Tyr. ' 

Such variants may be made using the methods of protein engineering and site- 
directed mutagenesis as described below 

25 

A second aspect of the invention provides a modified albumin having a. least 
90% sequence identity to a naturally-occurring albumin, which namrally- 
occurnng albumin is susceptible to cleavage with the S. cerevisiae yeas, 
asparty. protease 3 (Yap3p) when expressed in yeast, characterised in „a, the 
30 modified albumin is not susceptible to such cleavage. 
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Preferably, the modified albumin lacks a monobasic amino acid present in the 
naturally-occurring albumin protein. Suitably, the said monobasic amino acid 
is arginine. Conveniently, the modified albumin additionally lacks a pair of 
basic amino acids present in the naturally-occurring albumin, especially any of 
5 Lys, Lys; Lys, Arg; Arg, Lys; or Arg, Arg. Thus, in one particular 
embodiment, the naturally-occurring albumin is human albumin and the 
modified protein lacks Arg 410 and, optionally, one or both Lys 413 Lys 414 lysines. 
For example, the modified albumin may be human albumin having the amino 
acid changes R410A, K413Q, K414Q. Equivalent modifications in bovine 
10 serum albumin include replacing the Arg 408 and/or one or both of Arg 4I1 Lys 412 . 
The person skilled in the art will be able to identify monobasic sites and pairs 
of basic residues in other albumins without difficulty. 

The numbering of the residues corresponds to the sequence of normal mature 
15 human albumin. If the albumin is a variant (for example a polymorphic form) 
having a net deletion or addition of residues N-terminal to the position 
identified, then the numbering refers to the residues of the variant albumin 
which are aligned with the numbered positions of normal albumin when the two 
sequences are so aligned as to maximise the apparent homology. 

20 

A third aspect of the invention provides a polynucleotide encoding such a 
modified albumin. 

The DNA is expressed in a suitable yeast (either the DNA being for a modified 
25 albumin, or the yeast lacking the Yap3p function) to produce an albumin. 
Thus, the DNA encoding the albumin may be used in accordance with known 
techniques, appropriately modified in view of the teachings contained herein, 
to construct an expression vector, which is then used to transform an 
appropriate yeast cell for the expression and production of the albumin. 
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The DNA encoding the albumin may be joined to a wide variety of other DNA 
sequences for introduction into an appropriate host. The companion DNA will 
depend upon the nature of the host, the manner of the introduction of the DNA 
into the host, and whether episomal maintenance or integration is desired. 

5 

Generally, the DNA is inserted into an expression vector, such as a plasmid, 
in proper orientation and correct reading frame for expression. The vector is 
then introduced into the host through standard techniques and, generally, it will 
be necessary to select for transformed host cells. 

10 

Host cells that have been transformed by the recombinant DNA of the invention 
are then cultured for a sufficient time and under appropriate conditions known 
to those skilled in the art in view of the teachings disclosed herein to permit the 
expression and secretion of the albumin, which can then be recovered, as is 
15 known. 

Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast Integrating 
20 plasmids (Yips) and incorporate the yeast selectable markers HIS3, TRP1, 
LEU2 and URA3. Plasmids pRS413-416 are Yeast Centromere plasmids 
(YCps). Other yeast expression plasmids are disclosed in EP-A-258 067, EP- 
A-286 424 and EP-A-424 117. 

25 The polynucleotide coding sequences encoding the modified albumin of the 
invention may have additional differences to those required to produce the 
modified albumin. For example, different codons can be substituted which 
code for the same amino acid(s) as the original codons. Alternatively, the 
substitute codons may code for a different amino acid that will not affect the 

30 activity or immunogenicity of the albumin or which may improve its activity 
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or tmmunogenicity, as well as reducing its susceptibility to a Yap3p protease 
acl.v,.y. For example, site-directed mutagenesis or other techniques can be 
employed to create single or muhiple rauationSi S1K „ „ replacements 
■nseruons, delete, and transpositions, as described in Botstein and Shortle 
5 (1985). Since such modified coding sequences can be obtained by the 
apphcation of known techniques to the teachings contained herein such 
modtfied coding sequences are within the scope of the claimed invention. 

Exemplary genera of yeast contemplated to be useful in the practice of the 
present invention are Pichia, SaccHaromyces, Kluyveronyces, Canity 
Torulopsx, Hansenula (now reclassified as PicHia), H i aoplasaa 
Sch^sau^ces. G,eromyces, Pachysolen, Debaromyces, Masdumiko™ 
RhodospoMum, Leucosporidium. Botryoascus, Sparidiobolus, Endomyccpsis 
and the like. Preferred genera are those selected from the group consisting of 
Ate. Saccharomyces, Kluyveromyces, Yarroma and Hwemla. Examples 
ofSacCarontyces „. are S. cereMae, S. UaUcus and S. rou.iL Examples of 
U^eronyce, sp. are K. JrasiUs and K. lacis. Examp.es of HanenuU, 
(P'chu,) sp. are H. pofymorpha (now Pichia angusta), H. anonu^ (now P 
anomala) ami p. pastoris. Y. lipolyiica is an example of a suitable Yarrowia 
£v species. 



25 



30 



Methods for the transformation of 5. c«*fa, are taugh, generally in EP 251 
744, EP 258 067 and WO 90/01063, all of which are incorporated herein by 
reference. Suitable promoters for J. ceremae include those associated with 
*e PGK1 gene, GAL1 or CAL10 genes, CYC], PHOS, TUP,, ADH1 ADH2 

.hegenesforglycera.dehyde-3-phospha.edehydrogenase.hexokinase, pyruvate 
decarboxylase, phosphofrucokinase, triose phosphate isomerase 
Phosphoglucose isomerase, g.ucokinase, o-mating factor pheromone, a-maUng 
factor pheromone, me PRB1 promoter, the GPD1 promoter, and hybrid 
promoter, involving hybrids of pans of 5' reg„| a , ory regi „ ns wi(h pans of ? , 
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5 



regulatory regions of other promoters or with upstream activation sites (eg the 
promoter of EP-A-258 067). 



e are 



Convenient regulable promoters for use in Schizosaccharomyces pomh 
the thiamine-repressible promoter from the runt gene as described by Maundrell 
(1990) and the glucose-repressible^/ gene promoter as described by Hoffman 
& Winston (1990). 



Methods of transforming Pichia for expression of foreign genes are taught in, 
10 for example, Cregg et al (1993), and various Phillips patents (eg US 4 857 
467, incorporated herein by reference), and Pichia expression kits are 
commercially available from Invitrogen BV, Leek, Netherlands, and Invitrogen 
Corp., San Diego, California. Suitable promoters include A OX1 and AOX2. 

15 The Gellissen et al (1992) paper mentioned above and Gleeson et al (1986) J. 
Gen. Microbiol. 132, 3459-3465 include information on Hansenula vectors and 
transformation, suitable promoters being MOX1 and FMD1; whilst EP 361 991, 
Fleer et al (1991) and other publications from Rhone-Poulenc Rorer teach how 
to express foreign proteins in Kluyveromyces spp., a suitable promoter being 

20 PGKL 



The transcription termination signal is preferably the 3' flanking sequence of 
a eukaryotic gene which contains proper signals for transcription termination 
and polyadenylation. Suitable 3' flanking sequences may, for example, be 
25 those of the gene naturally linked to the expression control sequence used, ie 
may correspond to the promoter. Alternatively, they may be different in which 
case the termination signal of the S. cerevisiae ADH1 gene is preferred. 

The albumin is initially expressed with a secretion leader sequence, which may 
30 be any leader effective in the yeast chosen! Leaders useful in S. cerevisiae 
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include that from the mating factor a polypeptide (MFol) and the hybrid 
leaders of EP-A-387 319. Such leaders (or signals) are cleaved by the yeast 
before the mature albumin is released into the surrounding medium. When the 
yeast strain lacks Kex2 P activity (or equivalent) as well as being yap3, it may 
5 be advantageous to choose a secretion leader which need not be cleaved from 
the albumin by Kex2p. Such leaders include those of S. cerevisiae invertase 
(SUC2) disclosed in JP 62-096086 (granted as 91/036516), acid phosphatase 
(PHOS), the pre-sequence of MFa-1, /3-glucanase (BGL2) and killer toxin; S. 
diastaticus glucoamylase II; S. carlsbergensis a-galactosidase (MEU); K. lactis 
10 killer toxin; and Candida glucoamylase. 

Various non-limiting embodiments of the invention will now be described by 
way of example and with reference to the accompanying drawings in which: 



15 



Figure 1 is a general scheme for the construction of mutated rHA expression 
Plasmids, in which HA is a human albumin coding sequence, L is a sequence 
encodmg a secretion leader, P is the PRB1 promoter, T is the ADH1 
termmator, amp is an ampicillin resistance gene and LEU2 is the leucine 
selectable marker; 



20 



Figure 2 is a drawing representing a Western blot analysis of mutant rHA 
secreted by S. cerevisiae, in which Track A represents the culture supernatant 
from DB1 cir» pAYE316 (normal rHA), Track B represents the culture 
supernatant from DB1 cir+ pAYE464 (alteration 1), and Track C represents the 
25 culture supernatant from DB1 cir + pAYE468 (alteration 3); 

Figure 3 is a scheme of the construction of pAYE515; 

Figure 4 is a comparison of rHA fragment production by wild-type and 
30 protease-disrupted strains, presented as a drawing of an anti-HSA Western blot 
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of culture supernatant from shake flask cultures separated by non-reducing 10% 
SDS/PAGE, in which Track A corresponds to DBl cir° P AYE316 Track B 
corresponds to DXY10 cir' pAYE316 (ya P 3 strain), and Track C corresponds 
to ABB50 cir° pAYE316 (yap3, kex2 strain); 

5 

Figure 5 is similar to Figure 4 but shows Coomassie Brilliant Blue stained 
12.5% SDS Phastgel (Pharmacia) of culture supernatants from fed batch 
fermentations, namely Track D for the HSA standard, Track E for DBl cir° 
PAYE316, Track F for DBl Akex2 cir° pAYE522, and Track G for DXY10 
10 cir° pAYE522; and 

Figure 6 is a scheme for the construction of pAYE519. 
Detailed iWripfi,^ p f thft i nv0ntinn 



15 



AH standard recombinant DNA procedures are as described in Sambrook et al 
(1989) unless otherwise stated. The DNA sequences encoding HSA are derived 
from the cDNA disclosed in EP 201 239. 

20 Example 1: Modification of the HSA cDNA. 

In order to investigate the role of endoproteases in the generation of rHA 
fragment, the HSA cDNA (SEQ1 (which includes a sequence encoding the 
artificial secretion leader sequence of WO 90/01063)) was modified by site- 
directed mutagenesis. Three separate changes were made to the HSA sequence 
(SEQ2). The first, using the mutagenic primer FOG1, changed the Arg<"> 
codon only, repla cing it with an Ala codon, leaving intact the dibasic site, 
Lys<"Lys<". The second change, using primer FOG2, changed the residues 
407-409, including the C-terminal residues of fragment, from LeuLeuVal to 
AlaValAla. The third change, using the primer FOG3, altered residues 410- 



25 



30 
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414 from ArgTyrThrLysLys (SEQ3) to AlaTyrThrGlnGIn (SEQ4). The 
oligonucleotides encoded not only the amino acid changes, but also conservative 
base changes that create either a Pvull or an Spel restriction site in the mutants 
to facilitate detection of the changed sequences. 

5 

Single-stranded DNA of an M13mpl9 clone, mpl9.7 (EP 201 239; Figure 2), 
containing the HSA cDNA was used as the template for the mutagenesis 
reactions using the In Vitro Mutagenesis System, Version 2 (Amersham 
International pic) according to the manufacturer's instructions. Individual 

10 plaques were selected and sequenced to confirm the presence of the mutations. 
Double stranded RF DNA was then made from clones with the expected 
changes and the DNA bearing the mutation was excised on an Xbal/Sacl 
fragment (Figure 1). This was used to replace the corresponding wild-type 
fragment of pAYE309 (EP 431 880; Figure 2). The presence of the mutated 

15 Xbal/Sacl fragment within the plasmid was checked by digesting with PvwII or 
Spel as appropriate. These Hindlll fragments were excised and inserted into 
the expression vector pAYE219 (Figure 1) to generate the plasmids pAYE464 
(alteration 1, R410A), pAYE470 (alteration 2, L407A, L408V, V409A) and 
pAYE468 (alteration 3, R410A, K413Q, K414Q). These expression plasmids 

20 comprise the 5. cerevisiae PRB1 promoter (WO 91/02057) driving expression 
of the HSA/MFcd leader sequence (WO 90/01063) fused in-frame with the 
mutated HA coding sequence which is followed by the ADH1 transcription 
terminator. The plasmids also contain part of the 2/im plasmid to provide 
replication functions and the LEU2 gene for selection of transformants. 

25 

pAYE464, pAYE470 and pAYE468 were introduced into S. cerevisiae DB1 
cir + (a 9 leu2\ Sleep et al, 1990) by transformation and individual transformants 
were grown for 3 days at 30°C in 10ml YEPS (1% w /v yeast extract, 2% w /v 
peptone, 2% w /v sucrose) and then the supernatants were examined by anti-HSA 
30 Western blot for the presence of the rHA fragment. The Western blots clearly 
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showed that fragment was still produced by the strains harbouring pAYE464 
although the level was reduced slightly compared to the control expressing 
wild-type rHA. The mutations in the plasmid pAYE470 appeared to have no 
effect on the generation of fragment. However, DB1 cir+ P AYE468 showed 
5 a novel pattern of HSA-related bands, with little or no fragment. 

One example of each of DB1 cir + pA YE464 and DB1 cir + P AYE468 were 
grown to high cell density by fed batch culture in minimal medium in a 
fermenter (Collins, 1990). Briefly, a fermenter of 10L working volume was 
10 filled to 5L with an initial batch medium containing 50 mL/L of a concentrated 
salts mixture (Table 1), 10 mL/L of a trace elements solution (Table 2), 50 
mL/L of a vitamins mixture (Table 3) and 20 g/L sucrose. An equal volume 
of feed medium containing 100 mL/L of the salts mixture, 20 mL/L of the 
trace elements mixture, 100 mL/L of vitamins solution and 500 g/L sucrose 
15 was held in a separate reservoir connected to the fermenter by a metering 
pump. The pH was maintained at 5.7 ± 0.2 by the automatic addition of 
ammomum hydroxide or sulphuric acid, and the temperature was maintained 
at 30°C. The stirrer speed was adjusted to give a dissolved oxygen tension of 
>20% air saturation at 1 v/v/min air flow rate. 



20 



Table 1. Salts Mixture 



25 



Che ™<*l 1 Cnnrp ntra t;™ ^ > 1 


KH,P0 4 


114.0 


MgS0 4 


12.0 


CaCl 2 .6H,0 


3.0 


L Na,EDTA 


2.0 
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Table 2. Trace Elements Solution 



10 



15 



20 



J Chemical 


Concentration (g/L) 


ZnS0 4 .7H,0 




FeSO,.7H,0 


10.0 


MnS0 4 .4H,0 




CuS0 4 .5H,0 


0.079 


H 3 B0 3 


1.5 


KI 


0.2 


Na,MoCL2H,0 


0J 


CoCl 2 .6H,0 


0.56 


H 3 P0 4 


75mL/L 


Table 3. Vitamins Solution 


Chemical 


Concentration (g/L) 


Ca pantothenate 


1.6 


Nicotinic acid 


1.2 


m inositol 


12.8 


Thiamine HC1 


0.32 


Pyridoxine HC1 


0.8 


1 Biotin 


0.008 



The fermenter was inoculated with 100 mL of an overnight culture of S 
25 cerensiae grown in buffered minimal medium (Yeast nitrogen base [without 
ammo adds, without ammonium sulphate, Difco] 1.7 g/L, (NH 4 ),S0 4 5 g/L 
atnc acid monohydrate 6.09 g/L, Na 2 HP0 4 20.16 g/L, sucrose 20 g/L 
PH6.5). The initial batch fermentation proceeded until the carbon source had 
been consumed, at which point the metering pump was switched on and the 
30 adduion of feed was computer controlled (the micro MFCS system, B Braun 
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Melsungen, Germany) using an algorithm based on that developed by Wang et 
al (1979). A mass spectrometer was used in conjunction with the computer 
control system to monitor the off gases from the fermentation and to control the 
addition of feed to maintain a set growth rate (eg 0.1 h' 1 ). Maximum 
5 conversion of carbon substrate into biomass is achieved by maintaining the 
respiratory coefficient below L2 (Collins, 1990) and, by this means, cell 
densities of approximately 100 g/L cell dry weight can be achieved. The 
culture supernatants were compared with those of a wild-type rHA producer by 
Coomassie-stained SDS/PAGE and by Western blot. These indicated (Figure 
10 2) that, whilst elimination of the monobasic Arg 410 (pAYE464) did reduce the 
level of the fragment by a useful amount, removal of both potential protease 
sites (pAYE468) almost abolished the 45kDa fragment. 

The above data suggested that the generation of rHA fragment might be due to 
15 endoproteolytic attack, though the absence of an effect of removal of the 
potential Kex2p site Lys 413 Lys 414 (Sleep et al, 1990, and confirmed by other 
studies not noted here) unless combined with elimination of Arg 410 , had 
suggested a complex etiology. The reduction in the amount of fragment with 
the mutated rHA could in principle be due to an effect of the changes on the 
20 kinetics of folding of the molecule and not due to the removal of protease 
cleavage sites. 

Example 2: Disruption of the YAP3 gene. 

25 The YAP3 gene encoding yeast aspartyl protease 3 was mutated by the process 
of gene disruption (Rothstein 1983) which effectively deleted part of the YAP3 
coding sequence, thereby preventing the production of active Yap3p. 

Four oligonucleotides suitable for PCR amplification of the 5' and 3' ends of 
30 the YAP3 gene (Egel-Mitani et al, 1990) were synthesised using an Applied 
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Biosystems 380B Oligonucleotide Synthesiser. To assist the reader, we include 
as SEQ15 the sequence of the YAP3 gene, of which 541-2250 is the coding 
sequence. 

5' end 

YAP3 A : 5 ' -CGTCAGACCTTGCATGCAGCCAAGACACCCTCACATAGC-3 ' 

(SEQ5) 

YAP3B : 5 ' -CCGTTACGTTCTGTGGTGGCATGCCCACTTCCAAGTCCACCG-3 ' 

(SEQ6) 

3' end 

YAP3C : 5 ' -GCGTCTCATAGTGGAAAAGCTTCTAAATACGACAACTTCCCC-3 ' 

(SEQ7) 

YAP3D : 5 ' -CCCAAAATGGTACCTGTGTCATCACTCGTTGGGATAATACC-3 ' 

(SEQ8) 

PCR reactions were carried out to amplify individually the 5' and 3' ends of 
the YAP3 gene from 5. cerevisiae genomic DNA (Clontech Laboratories, Inc). 
Conditions were as follows: 2.5/xg/ml genomic DNA, 5/xg/ml of each primer, 
20 denature at 94°C 61 seconds, anneal at 37°C 121 sees, extend at 72°C 181 
sees for 40 cycles, followed by a 4°C soak, using a Perkin-Elmer-Cetus 
Thermal Cycler and a Perkin-Elmer-Cetus PCR kit according to the 
manufacturer's recommendations. Products were analysed by gel 
electrophoresis and were found to be of the expected size. The 5' fragment 
25 was digested with Sphl and cloned into the Sphl site of pUC19HX (pUC19 
lacking a Hindlll site) to give pAYE51 1 (Figure 3), in which the orientation is 
such that YAP3 would be transcribed towards the Kpnl site of the pUC19HX 
polylinker. The 3' YAP3 fragment was digested with Hindlll and AspllS (an 
isoschizomer of Kpnl) and ligated into pUC19 digested with #/7zdIII//ij/>718 to 
30 give pAYE512. Plasmid DNA sequencing was carried out on the inserts to 
confirm that the desired sequences had been cloned. The HindUl/AspllS 
fragment of pAYE512 was then subcloned into the corresponding sites of 
pAYE511 to give pAYE513 (Fig 3), in which the 5' and 3' regions of YAP3 



10 



15 
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are correctly orientated with a unique Hindlll site between them. The URA3 
gene was isolated from YEp24 (Botstein et al, 1979) as a tfmdIII fragment and 
then inserted into this site to give pAYE515 (Fig 3), with URA3 flanked by the 
5' and 3' regions of YAP3, and transcribed in the opposite direction to YAP3. 

5 

A ura3 derivative of strain DB1 cir° pAYE316 (Sleep et al, 1991) was 
obtained by random chemical mutagenesis and selection for resistance to 5- 
fluoro-orotic acid (Boeke et al, 1987). The strain was grown overnight in 100 
mL buffered minimal medium and the cells were collected by centrifugation and 
10 then washed once with sterile water. The cells were then resuspended in 10 
mL sterile water and 2 mL aliquots were placed in separate 15 mL Falcon 
tubes. A 5 mg/mL solution of N-methyl-N'-nitro-N-nitrosoguanidine (NTG) 
was then added to the tubes as follows: 0 (iL, 20 fiL, 40 (iL, 80 /xL or 160 
nL. The cells were then incubated at 30°C for 30 min and then centrifuged 
15 and washed three times with sterile water. Finally, the cells were resuspended 
in 1 mL YEP (1% w/v yeast extract, 2% w/v Bacto peptone) and stored at 
4°C. The percentage of cells that survived the mutagenic treatment was 
determined by spreading dilutions of the samples on YEP plates containing 2% 
w/v sucrose and incubating at 30°C for 3 days. Cells from the treatment which 
20 gave approximately 50% survival were grown on YEP plates containing 2% 
w/v sucrose and then replica-plated onto YNB minimal medium containing 2% 
w/v sucrose and supplemented with 5-fluoro-orotic acid (1 mg/mL) and uracil 
(50 /ig/mL). Colonies able to grow on this medium were purified, tested to 
verify that they were unable to grow in the absence of uracil supplementation 
25 and that this defect could be corrected by introduction of the URA3 gene by 
transformation. One such strain, DBU3 cir° pAYE316, was transformed with 
the SphUAspm YAP3- URA3-YAP3 fragment of pAYE515 with selection for 
Ura + colonies. A Southern blot of digested genomic DNA of a number of 
transformants was probed with the 5' and 3' ends of the YAP3 gene and 
30 confirmed the disruption of the YAP3 gene. An anti-HSA Western blot of 
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YEPS shake-flask supernatants of two transformants indicated that disruption 
of YAP3 markedly reduced rHA fragment levels. 

One yap3 derivative of DBU3 cir° pAYE316, designated DXY10 cir° 
5 pAYE316, was grown several times by fed-batch fermentation in minimal 
medium to high cell dry weight. When supernatants were examined by 
Coomassie-stained PAGE and anti-HSA Western blot (Figs 4 and 5), the 
reduction in the level of rHA 45kDa fragment was clearly apparent; estimates 
of the amount of the degradation product vary from V 3 to V 3 of the levels seen 

10 with the YAP3 parent. The amount of rHA produced was not adversely 
affected by the yap3 mutation, indeed DXY10 cir° pAYE316 was found to 
produce 30-50% more rHA than the YAP3 equivalent, DB1 cir° pAYE316. 
Despite the fact that cleavage of the leader sequence from the HA sequence is 
C-terminal to a pair of basic residues, the rHA was found to have the correct 

15 N-terminus. 

The fermentation broth was centrifuged to remove the cells and then subject to 
affinity chromatographic purification as follows. The culture supernatant was 
passed through a Cibacron Blue F3GA Sepharose column (Pharmacia) which 
20 was then washed with 0.1M phosphate glycine buffer, pH8.0. The rHA was 
then eluted from the column with 2M NaCl, 0.1M phosphate glycine, pH8.0, 
at which point it was >95% pure. It may be purified further by techniques 
known in the art. 

25 The albumin may alternatively be purified from the culture medium by any of 
the variety of known techniques for purifying albumin from serum or 
fermentation culture medium, for example those disclosed in WO 92/04367, 
Maurel et al (1989), Curling (1980) and EP 524 681. 



WO 95/23857 



PCT/GB95/00434 



20 

Example 3: Disruption of the KEX2 gene in ayap 3 strain. 

To construct a strain lacking both Yap3p and Kex2p activity, a lys2 derivative 
of yeast strain DXY10 cir» (pAYE316) was obtained by random chemical 
5 mutagenesis and selection for resistance to a-amino adipate (Barnes and 
Thorner, 1985). Cells were mutagenic as in Example 2 and then plated on 
YNB minimal medium containing 2% w/v sucrose and supplemented with 2 
mg/mL DL-a-amino adipate as the sole nitrogen source and 30 M g/mL lysine 
Colonies able to grow on this medium were purified and tested to verify that 
10 they were unable to grow in the absence of lysine supplementation and that this 
defect could be corrected by the introduction of the LYS2 gene by 
transformation. This strain was then mutated by the process of gene disruption 
winch effectively disrupted part of the KEX2 coding sequence, thereby 
prevents production of active Kex2p. To assist the reader, the sequence of 
15 the KEX2 gene is reproduced herein as SEQ14, of which 1329-3773 is the 
coding sequence. 

Four oligonucleotides suitable for PGR amplification of the 5' and 3' ends of 
the KEX2 gene (Fuller et al, 1989) were synthesised using an Applied 
20 Biosystems 380B Oligonucleotide Synthesiser. 

5' end 

KEX2A : ^ ' -CCATCTGGATCCAATGGTGCTTTGGCCAAATAAATAGTTTCAGC-3 ' 
25 KEX2B: 5-GCTTCTTTTACCGGTAACAAGCTTGAGTCCATTGG-3' 

(SEQ10) 

3' end 

3o KEX2C: 5 ' -GGTAAGGTTTAGTCGACCTATTTTTTGTTTTGTCTGC-3 < 

KEX2D: 5 '^GAAACGTATGAATTCGATATCATTGATACAGACTCTGAG^-3 1 ' 

(SEQ12) 
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15 



20 



25 



PCR reasons were carried ou, t „ amplify individually lhe s . a „ d 3, ends rf 
the KEX2 gene from S. cerensiae genomic DNA (Clor„ech Laboratories l„c) 
Conditions were as fol.ows: 2.5 „g/ ml genomic DNA , 5 ^ rf ^ 
pnmer, denature 94°C 61s, anneal 37«C 121s, extend 72°C 181s for 40 cycles 
5 followed by a 4'C soak, using . Perkin-E.mer-Cetus Thermal Cycler and a 

PerMn-Elmer-CetusPCRkitaccordingtorhemannfacturer^commendations 
Produce were analysed by gel electrophoresis and were found ,„ be of the 
expected size (0.9 kb for the 5' product and 0.62 kb for the 3' product) The 
5' product was digested with BamHl and HMIU and the 3' product was 
dtgested with HMUl and Sail and then the two fragment were together cloned 
.nto pUC19HX digested with BanMl and San. A 4.8 kb *„dl„ fragment 
composing me S. Cerent LYS2 gene (Barnes & Thon™, 1985) was men 
■nserted .nto the resulting p l asraid at HMm fie ^ ^ ^ 

fragments) to form pAYE519 (Fig 6). 

The fys2 derivative of DXV.O cir- (pAYE3.6), fys2-,6, was Wormed with 
6.0 kb KEX2-LYS2-KEX2 fragment of pAYE519, selecting for Lys* 
monies. A Southern blot of digested genomic DNA of a number of 
transformants was probed with the 5' and 3' ends of the KEX2 gene and 
confirmed the disruption of the KEX2 gene. An anti-HSA Western b.o, of 
YEPS shake-nask culture supernatant* of these transformants indicated that 
dtsnrption of KEX2 in >yap3 strain ^ ^ rf ^ ^ 

fi.rti.er, despite me lack of an effect of disruption of ffi.2a.one in Example 
4 below. Analysis of the rHA produced by one such strain, ABB50, indicated 
that the leader sequence was incorrectiy processed, Lading to an abnormal N- 

terminus. 



Thestrain ABB50 (pAYE316) was cured of its plasmid (Sleep e, a/, 1991)and 
transformed with a simi.ar plasmid, P AYE522, in which the hybrid leader 
sequence was repfaced by the S. cnUa. inverse { SVC2) .eader sequence 



WO 95/23857 



PCT/GB95/00434 



22 

such that the encoded leader and the junction with the HSA sequence were as 
follows: 



10 



15 



MLLQAFLFLLAGFAAKISA I DAHKS (SE Q13) 
Invertase leader HSA 

In mis consm,c, cleavage of the leader sequence from HSA does no, rely upon 
activt,y of ,he Ke*2 protease. The strain ABB50 (pAYE522) was found to 
produce rHA with a simply very low leve. of rHA fragmen,, but in this 
stance the N-terminus corresponded to to of serunwierived HSA, ie there 
was efficient and precise removal of the leader sequence. 

Example 4: Disruption of .he KEX2 gene a,„ ne (Comparative Example). 

By a sintilar method to that disclosed in Example 3 the KEX2 gene was 
dtsntpted in S. ce misiae . m s strai „ ^ me Yap3p 
was therefore „o, within the scope of the invention. When this strain was 
Srown in fed batch fermentation the rHA produced contained similar amounts 

^i-nttotoproducedbystiainswithanintact^gene. .naddition 
20 the overall level ofrHA was reduced and the leader sequence was no, correcU 
processed, leading lo an abnormal N-terminus. 

Example S: Identification of equivalent protease in Pichia. 

25 As now above, ^-Saccha ronyces yeas, similarly produce U,e undesirable 
fragment of rHA and therefore have the Yap3 P proteolytic activity. We have 
confirmed mis by performing Sou<her„ hybridisations of Pichia mgU s,a DNA 
usmg me S. cerevisiae YAP3 gene as a probe. A specific DNA fragmen, was 
.denttfied, showing mat, no, only is ,he Yap3 P proteolytic activity present in 
P. angusia, b„, a specific nomologue of me YAP3 gene is present aJso 
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The method of Southern hybridization used for detection of the YAP3 
homologue can be adapted to clone the gene sequence from a genomic DNA 
library of Pichia DNA using standard procedures (Sambrook et al, 1989). 
Disruption of the YAP3 homologue in Pichia sp. can be achieved using similar 
5 techniques to those used above for Saccharomyces (Cregg and Madden, 1987). 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Delta Biotechnology Limited 

(B) STREET: Castle Court, Castle Boulevard 

(C) CITY: Nottingham 

(D) STATE: Nottinghamshire 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : NG7 1FD 

(ii) TITLE OF INVENTION: Yeast strains and modified albumins 

(iii) NUMBER OF SEQUENCES: 15 

<iv) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 73-. 1827 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATGAAGTGGG TAAGCTTTAT TTCCCTTCTT TTTCTCTTTA GCTCGGCTTA TTCCAGGAGC 60 

TTGGATAAAA GA GAT GCA CAC AAG AGT GAG GTT GCT CAT CGG TTT AAA 108 
Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys 
15 10 

GAT TTG GGA GAA GAA AAT TTC AAA GCC TTG GTG TTG ATT GCC TTT GCT 156 
Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu lie Ala Phe Ala 
15 20 25 

CAG TAT CTT CAG CAG TGT CCA TTT GAA GAT CAT GTA AAA TTA GTG AAT 204 
Gin Tyr Leu Gin Gin Cys Pro Phe Glu Asp His Val Lys Leu Val Asn 
30 35 . 40 

GAA GTA ACT GAA TTT GCA AAA ACA TGT GTT GCT GAT GAG TCA GCT GAA 252 
Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu 
45 50 55 60 



300 



348 



396 



444 
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£ £ a a 2 £ S a 2 s £ a - ™ ~ - 

65 ™* As P Lys Leu Cys Thr 

" S £ 2 a Si S a a a S a a « » « 

80 yr G *£ Glu Met Ala Asp C ys Cys Ala 

90 

* a a a a 2- a a - - = c c« K « „ « 

S S K 2 S 25 2 S a - - - 5 - - » 

120 y 

S S 2 2 a £ - - s » - r r rac m IM 

130 Phe Jj? Lys Tyr Leu Tyr 

si s s 2; a a a ~ » » s « « ctc m « 

145 y Me Ala Pro Glu Leu Leu Phe 

155 

s = a a 5 a a a a a s a a s - s 

5 ^ £ = " = = 5 « 2! a 2: s S 5 2 

185 

*£===SK222aa2aaa 
£ - 2; a a a 2525 a a a a a s 

CAG AGA TTT nnr* « 220 

- - - a a a a » s « « « «. » 5S 

235 

a 2 a a « a a a a a a a a a - - 

240 2 U ° ly As P Leu Leu Glu 

250 

a a a a a a a a a a a s a « •» - 

255 260 y 116 Cys Glu Asn Gin 

265 

GAT TCG ATC TCC ACT AAA PTr aar^ 

- - «. - _ a a a a a a a a a a a 

a a a a a s a a a a a a «■ « «* « 

28S 290 al 01 " Jg "P Olu «.t Pro M . 

aaaaaaaaaaaaaaaS 
a a 5 a a a a a a a a a a a a a 



492 
540 
588 
636 
684 
732 
780 
828 
876 
924 
972 
1020 
1068 
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320 325 330 

GAA TAT GCA AGA AGG CAT CCT GAT TAC TCT GTC GTG CTG CTG CTG AG A 1116 
Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg 
335 340 345 

CTT GCC AAG ACA TAT GAA ACC ACT CTA GAG AAG TGC TGT GCC GCT GCA 1164 
Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys Cys Ala Ala Ala 
350 355 360 

GAT CCT CAT GAA TGC TAT GCC AAA GTG TTC GAT GAA' TTT AAA CCT CTT 1212 
Asp Pro His Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu 
365 370 375 380 

GTG GAA GAG CCT CAG AAT TTA ATC AAA CAA AAT TGT GAG CTT TTT GAG 1260 
Val Glu Glu Pro Gin Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Glu 
385 390 395 

CAG CTT GGA GAG TAC AAA TTC CAG AAT GCG CTA TTA GTT CGT TAC ACC 1308 
Gin Leu Gly Glu Tyr Lys Phe Gin Asn Ala Leu Leu Val Arg Tyr Thr 
400 405 410 

AAG AAA GTA CCC CAA GTG TCA ACT CCA ACT CTT GTA GAG GTC TCA AGA 1356 
Lys Lys Val Pro Gin Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg 
415 420 425 

AAC CTA GGA AAA GTG GGC AGC AAA TGT TGT AAA CAT CCT GAA GCA AAA 1404 
Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His Pro Glu Ala Lys 
430 435 440 

AGA ATG CCC TGT GCA GAA GAC TAT CTA TCC GTG GTC CTG AAC CAG TTA 1452 
Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gin Leu 
445 450 455 460 

TGT GTG TTG CAT GAG AAA ACG CCA GTA AGT GAC AGA GTC ACC AAA TGC 1500 
Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys 
465 470 475 

TGC ACA GAA TCC TTG GTG AAC AGG CGA CCA TGC TTT TCA GCT CTG GAA 1548 
Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu 
480 485 490 

GTC GAT GAA ACA TAC GTT CCC AAA GAG TTT AAT GCT GAA ACA TTC ACC 1596 
Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr 
495 500 505 

TTC CAT GCA GAT ATA TGC ACA CTT TCT GAG AAG GAG AGA CAA ATC AAG 1644 
Phe His Ala Asp lie Cys Thr Leu Ser Glu Lys Glu Arg Gin lie Lvs 
510 515 520 

AAA CAA ACT GCA CTT GTT GAG CTC GTG AAA CAC AAG CCC AAG GCA ACA 1692 
Lys Gin Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr 
525 530 535 540 

AAA GAG CAA CTG AAA GCT GTT ATG GAT GAT TTC GCA GCT TTT GTA GAG 1740 
Lys Glu Gin Leu Lys Ala Val Met Asp Asp Phe Ala Ala Phe Val Glu 
545 550 555 

AAG TGC TGC AAG GCT GAC GAT AAG GAG ACC TGC TTT GCC GAG GAG GGT 1788 
Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly 
560 565 570 

AAA AAA CTT GTT GCT GCA AGT CAA GCT GCC TTA GGC TTA TAA 1830 
Lys Lys Leu Val Ala Ala Ser Gin Ala Ala Leu Gly Leu 
575 580 585 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 585 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Ala His Lys Ser Glu Val Ala His Arg P he Lys Asp Leu Gly Glu 

«. ». Phe Ma _ w Leu , >la ^ ^ ^ ^ 

^ 30 
«• Cys Pro Phe „. » 5p His vj; Ly3 val ^ ^ v ^ ^ 

Ph. »l. Lye Ihr c, v.! «. »«„ G1 „ s „ iU J' ^ 

60 

Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val ^ ^ 

75 80 
»r 9 0lu „ Iyr 0 , y ». MM Ma Jsp ^ ^ ju ^ p r) j 

«1. te, «.» «„ cys Ph. ^ eln „ ly . a , p Mp ^ J* 

105 110 
P~ »ro jj. „ al Sto Pro Gl „ ^ v ^ ^ ^ ^ Mi 

125 

» S p a ». 01u Ihr Phe ^ lys Lys ^ ^ ^ ciu ue Ma ^ 

gj p„ n r Ph. jg Ml Pto 0l „ teu 2° pte au 

T* Ly. M. » la , h . Ihr ol „ cys cys su ui ^ ^ ^ 

Cys Leu Leu Pro Lys Leu Asn n„ t« * 

180 7 ^ ASP G1U ^ Ar 9 Asp Glu Gly Lys Ala Ser 

AU5 190 
«. jg Gln _ _ lys Ma ^ ^ ^ 

^ UU 205 
Arg Ala Phe Lys Ala Trp Ala Val ai* » 

210 rp }J| Vai Ala Arg Leu Ser Gin Arg Phe Pro 

220 

g. «• «u Phe » la Vli s „ Lye leu ^ ^ u l ^ 

235 240 

" ^ a ^ " 1S ^ S «. eye ,u «. p Mp 

*"° 255 
Arg Ala Asp Leu Ala Lys Tyr He Cv« ri.. » 

260 7 7 IiS ^ Asn Gln As P Ser He Ser 

265 270 

Ser Lys Leu Ly S Glu Cy6 Cys «« Lys P ro Leu Leu Glu Lys Ser His 
Cys He Ala Glu Val Glu Asn Asp Glu Met P ro Ala Z Leu Pro Ser 
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290 295 

^" 300 

Leu Ala Ala Asp Phe Val n., t 

305 PP hS HI Glu Ser ^ Asp Val Cys Lys Asn Tyr Ala 

315 320 
«• ». ^ ».p jg Phe Leu Gly „„ ^ ^ ^ ju 

J30 335 
Arg His Pro Asp Tyr Ser Val VaI r« r 

340 al Leu Leu Ara Ala Lys Thr 

J " 350 

Tyr Glu Thr Thr Leu Glu Lys Cys Cvs Ala *i a &i * 

355 y 3^ Cys Ala Ala Ala Asp Pro His Glu 

365 

Cys Tyr Ala Lys Val phe ^ Glu phe ^ ^ ^ ^ ^ ^ ^ 
«ln Asn Leu n. Lys s „ Asn cys Glu 

395 400 
Tyr Lys Phe Gin Asn Ala Leu Leu Val Arq Tvr Thr t t 

405 ^ Tnr L y s L ys Val Pro 

410 415 
Gin Val Ser Thr Pro Thr Leu Val Glu Val ^ * * 

420 Val Ser Ar S Asn Leu Gly Lys 

4 ^ 430 
val Gly ser Lys Cys cys Lys « is Pro Glu Ala Lys Arg ^ pro 

440 445 
Ma Glu Asp Tyr Leu ser Ja i Val Leu Asn Gln ^ ^ ^ ^ ^ 

Ciu Ly6 Thr Pro val ser Asp Arg Val Thr ^ ^ ^ ^ ^ 

T 475 480 

«- V! A S „ A,, jj, Pro Cy „ p „ e s „ ^ v ^ ^ ^ 

450 495 

^ v sl P „ g. Glu Phe as „ Ma ^ ar ^ ^ 

SUb 510 
lie cys Thr Leu Ser Glu Lys Glu Arc Gin ti- t 

515 7 9 16 Lys Gln T hr Ala 

525 

- V.1 OXu u. V.X lys g ty6 p „ Ly . mr 

OJ:5 540 

a «. » « ..p jg Pte Ma „. Phe Glu iys cys cys 

560 

«. **P MP Lys «. Thr c y . Ph . A1 , jjj clu Gly Lye Lys ^ 

Ala Ala Ser Gln Ala Ala Leu Gly Leu 
580 585 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



570 5?5 
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(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3, 

Arg Tyr Thr Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS • 

li\ S? H: 5 araino aci * s 

' » TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 
Ala Tyr Thr Gin Gin 
1 S 

(2) INFORMATION FOR SEQ ID NO: 5, 
(i) SEQUENCE CHARACTERISTICS: 

IHJ TYPE: nucleic acid 

(C) STRANDEDNESS: sinale 

(D) TOPOLOGY: linefr 9 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi, SEQUENCE DESCRIPTION: SEQ ID NO- 5- 
CGTCAGACCT TGCATGCAGC CAACACACCC TCACATAGC ' 
(2, INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS • 

' ' WPEs nucleic acid 

^RANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic, 



39 
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(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCGTTACGTT CTGTGGtGGC ATGCCCACTT CCAAGTCCAC CG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCGTCTCATA GTGGAAAAGC TTCTAAATAC GACAACTTCC CC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCCAAAATGG TACCTGTGTC ATCACTCGTT GGGATAATAC C 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCATCTGGAT CCAATGGTGC TTTGGCCAAA TAAATAGTTT CAGC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCTTCTTTTA CCGGTAACAA GCTTGAGTCC ATTGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTAAGGTTT AGTCGACCTA TTTTTTGTTT TGTCTGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGAAACGTAT GAATTCGATA TCATTGATAC AGACTCTGAG TACG 
(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Leu Gin Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys 
15 10 15 

lie Ser Ala Asp Ala His Lys Ser 
20 

(2) INFORMATION FOR SEQ ID NO: .14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4106 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GAATTCTCTG TTGACTACTA AACTGAGAGA ATTTGCCGAG ACTCTAAGAA CAGCTTTGAA 60 

AGAGCGTTCT GCCGATGATT CCATAATTGT CACTCTGAGA GAGCAAATGC AAAGAGAAAT 120 

CTTCAGGTTG ATGTCGTTGT TCATGGACAT ACCTCCAGTG CAACCAAACG AGCAATTCAC 180 

TTGGGAATAC GTTGACAAAG ACAAGAAAAT CCACACTATC AAATCGACTC CGTTAGAATT 240 

TGCCTCCAAA TACGCAAAAT TGGACCCTTC CACGCCAGTC TCATTGATCA ATGATCCAAG 300 

ACACCATATG GTAAATTAAT TAAGATCGAT CGTTTAGGAA ACGTCCTTGG CGGAGATGCC 360 

GTGATTTACT TAAATGTTGA CAATGAAACA CTATCTAAAT TGGTTGTTAA GAGATTACAA 420 

AATAACAAAG CTGTCTTTTT TGGATCTCAC ACTCCAAAGT TCATGGACAA GAAAACTGGT 480 

GTCATGGATA TTGAATTGTG GAACTATCCT GCCATGGCTA TAATTTACCT CAGCAAAAGG 540 

CATCCGGTAT TAGATACCAT GAAAGTTTGA TGACTCATGC TATGTTGGAT CACTGGCTGC 600 

CACGTCGATG AAACGTCTAA ATTACCACTT CGCTACCGTC TGAAAATTCC TGGGGTAAAG 660 
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ACTCCGGTAA 


AGACGGATTA 


TACGTGATGA 


CTCAAAAGTA 


CTTCGAGGAG 


TACTGCTTTC 


720 


AAATTGTGGT 


CGATATCAAT 


GAATTGCCAA 


AAGAGCTGGC 


TTCAAAATTC 


ACCTCAGGTA 


780 


AGGAAGAGCC 


GATTGTCTTG 


CCCATCTGGA 


CCCAATGGTG 


CTTTGGCCAA 


ATAAATAGTT 


840 


TCAGCAGCTC 


TGATGTAGAT 


ACACGTATCT 


CGACATGTTT 


T ATTTTT A CT 

x * * * x x x x n \^ x 


ATAPATAPAT 




AAAAGAAATA 


AAAAATGATA 


ACGTGTATAT 


TATTATTCAT 


AT AATC A A TG 
n ■*> «*n x wnn x vj 


AGGGTPATTT 
nuvau X ^xl XXX 


q An 


TCTGAAACGC 


AAAAAACGGT 


AAATGGAAAA 


AAAATAAAGA 


TAG A A A A AG A 


A A APA A APA A 


1 no rt 

lUzU 


AGGAAAGGTT 


AGCATATTAA 


ATAACTGAGC 


TGATACTTCA 


ACAGPATPGP 


TGA AG AP A BP 


i nan 


AG T ATTG AAA 


CCGAAACATT 


TTCT AAAGG C 


AAACAAGGTA 


PTPPAT ATTT 
x w*v^n x n xxx 


flfTPr APPTP 


1 1 ah 


TTCTTTCTCT 


CGTTTCATAT 


GCATAATTCT 


GTCATAAGCC 


TGTTCTTTTT 
x vj x x^«xxxxx 


PPTGGPTT A A 




ACATCCCGTT 


TTGTAAAAGA 


GAAATCTATT 


CCACATATTT 
wv^nv^nxnx x x 


P ATTP ATTPG 
wri x x \*t\ X X 


PPT A PP AT A P 
ULlnLLninL 


XZDU 


TAAGGATAAA 


CTAATCCCGT 


TGTTTTTTGG 


CCTCGTCACA 


TA ATT AT A A A 
x/xnx Inlnnn 




i ion 


ATTATCAGAT 

*» • X *m X vrivn X 


GAAAGTGAGG 


A A AT AT ATT A 


wX 1 IHibtl x 


^*p/v r*r* r*r* 
x TGG IGGG CC 


TTTTCAACAT 


1380 


CCGPTPTTGT 

w W\J w- X \+ X X VJ X 


ATCATPACAA 
xi x \*n x v-ri unn 


PAAATTPPAT 




TACGTCACGA 


CAGTATTTTG 


1440 


PTGTAGAAAG 


PA ATGA AAPA 
v*nn x unnnVf n 


X X /\ J. l_ l*. k^Vj X 


x bGAGGAAAT 


GCATCCAAAT 


TGGAAATATG 


1500 


A A P A TG A TG T 
nn k-n x on x \» J. 


TPGAGGGPTA 


OOnxin ^Cn 1 x 


ATGTTTTTTC 


AAAAGAGTTG 


CTAAAATTGG 


1560 


G P A A A A G A T P 


ATPATTAP A A 
ni On x X nunn 


unb x X ALAuu 


GGGATAACAA 


CGACCACATA 


TTATCTGTCC 


1620 


ATGATTTATT 


CCCG PGT A A P 


GAPPTATTTA 




GGTGCCTGCT 


CCACCAATGG 


1680 


ACTCAAGCTT 


GTTACCGGTA 


AA AGAAGPTG 


A GG A T A A A r"T 
r\Kj\3f\ x nnnL. 1 


UAGLAi AAAT 


GATCCGCTTT 


1740 


TTGAGAGGCA 


GTGGCACTTG 


GTCAATCCAA 


GTTTTPPTGG 

w X X X X x 


wn« x v>/i 1 iiX A 


AATGTT CTTG 


1800 


ATCTGTGGTA 


CAATAATATT 


AC AGGCG C AG 


GGGTPGTGGP 


TGPPATTPTT 


GAlbAiGGCC 


I860 


TTGACTACGA 


AAATGAAGAC 


TTGAAGGATA 


ATTTTTGP.GC 

«»X A X X A SJ 


TG A AGGTTPT 


x VjVj^yn J. x 1 L-n 


t oon 


ACGACAATAC 


CAATTTACCT 


AAACCAAGAT 


TATPTGATGA 


PTAPPATGPT 


AUGAGATGTG 




CAGGTGAAAT 


AGCTGCCAAA 


AAAGGTAACA 


ATTTTTGPGG 
nx xxx xwvAvVa 


TGTPGGGPTA 


bbl xACAACG 


2040 


CTAAAATCTC 


AGGCATAAGA 


ATCTTATCCG 


GTGATATCAP 


TAPGG A AG AT 


GAAGCTGCGT 


ZlUU 


CCTTGATTTA 


TGGTCTAGAC 


GTAAACGATA 


T AT AT TC ATG 


CTC ATGG GG T 


PPPGPTP A*PP 




ACGGAAGACA 


TTTACAAGGC 


CCTAGTGACC 


TGGTGAAAAA 


GGCTTTAGTA 


AAAGGTGTTA 
nnnuu x w X X n 


o oon 


CTGAGGGAAG 


AGATTCCAAA 


GGAGCGATTT 


ACGTTTTTGC 


CAGTGGAAAT 


GGTGGAACTC 


^ ^ o V 


GTGGTGATAA 


TTGCAATTAC 


GACGGCTATA 


CTAATTCCAT 


ATATTCTATT 


APT A TTGGGG 




V- X Al rGATCA 


CAAAGATCTA 


CATCCTCCTT 


ATTCCGAAGG 


TTGTTCCGCC 


GTCATGGCAG 


2400 


TCACGTATTC 


TTCAGGTTCA 


GGCGAATATA 


TTCATTCGAG 


TGATATCAAC 


GGCAGATGCA 


2460 


GTAATAGCCA 


CGGTGGAACG 


TCTGCGGCTG 


CTCCATTAGC 


TGCCGGTGTT 


TACACTTTGT 


2520 


TACTAGAAGC 


CAACCCAAAC 


CTAACTTGGA 


GAGACGTACA 


GTATTTATCA 


ATCTTGTCTG 


2580 


CGGTAGGGTT 


AGAAAAGAAC 


GCTGACGGAG 


ATTGGAGAGA 


TAGCGCCATG 


GGGAAGAAAT 


2640 
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ACTCTCATCG 


CTATGGCTTT 


GGTAAAATCG 


ATGCCCATAA 


GTTAATTGAA 


ATP.TPPA AH A 
niul utnnun 


9 7fin 


CCTGGGAGAA 


TGTTAACGCA 


CAAACCTGGT 


TTTACCTGCC 


AACATTGTAT 


fiTTTPPPAP.T 
will V/VyCnv X 




CCACAAACTC 


CACGGAAGAG 


ACATTAGAAT 


CCGTCATAAC 


CATATCAGAA 


AAAAP.TPTTP 
AAnnu l Ul 1 v< 




AAGATGCTAA 


CTTCAAGAGA 


ATTGAGCACG 


TCACGGTAAP 

x wnwuu x nxiv 


TP.TAP.ATATT 
luinuninl X 


PTATTiPTAPTA7A7A 

G A X AGAGAAA 


noon 


TTAGGGGAAC 


TACGAPTP.TP 


fl A TTT A A T A T 
unl 1 Innlnl 




P 7A T7A T\ T^' T«M'/"^ TV 

GAXAATTTCA 


AACCTTGGCG 


2940 


1 IVjl/UlwAUU 


rvnurlu A lull 


TP TA TP 7A P JA P P 


P J\ T'^P TV TV TV TV 

G AT x CAAAGA 


^^^W^^^^^t ^^*t Mil 11^^ 

CTGG AC ATT C 


ATGTCTGTAG 


3000 


CACATTGGGG 


TP. A P. A APfifiP 


PTAPPTPATT 
u 1 nuu X un 1 X 


pp TV TA TA TV *T*P TV Tv 


GGTTAAGACA 


acagaaaatg 


3060 


GACACAGGAT 


TP.APTTPPAP 


nui iuunuut 


X G AAGCTCTT 


TGGGGAATCC 


ATTGATTCAT 


3120 


CTAAAACAGA 


AACTTTCGTC 


TTTP.P.AAAPP. 
i 1 xuunnrliwu 


AT A A AP 7i fir* 7a 
A X nnnunuun 


rrT*rr ta tv r*r* tv 
bGTTGAACCA 


G CTG CT AC AG 


3180 


AAAGTACCGT 


ATCACAATAT 


TCTGCCAGTT 
i w i uv^uno 1 1 


PA A (""FTP TAT 


X XGGAXGAGC 


GCTACTTCTA 


3240 


CATCTTCTAT 


CTCAATTGGT 


GTGGAAACGT 

u x vjuiuin^u x 


^.uul^l—rll IwL 


PP TA 7A 7A PP TAPT 


ACTGCGAGTA 


3300 


CCGATCCTGA 


TTCTGATCCA 


AACACTCCTA 


AAAAAPTTTP 
nftn/inv 11 1^ 


PTPTPPT TAP P 
Lltl GG X nub 


P Ji 7A P PP Tv TP /■*» 

CAAG CCATG C 


J JbU 


ATTATTTTTT 


AACAATATTT 


TTGATTGGCC 


pp AP A TTTTT 
ULnUnl 1111 


GG lb i X ATAC 


TTCATGTTTT 


3420 


TTATGAAATC 




ATPAP-iaPPT 


TV Tv /"» TV P /"^ /"*/"* Tv 

C AAG AG CGG A 


AA CG T ATG AA 


TTCGATATCA 


3480 


TTP. & T A P AP & 
x 1 uA 1 nLnun 


UILI GAG I AC 


GATTCTACTT 


TGGACAATGG 


AACTTCCGGA 


ATTACTGAGC 


3540 


p^O T\ TV TV P P T> 

GGG AAG AGG T 


TGAGGACTTC 


GATTTTGATT 


TGTCCGATGA 


AGACCATCTT 


GCAAGTTTGT 


3600 


CTTCATCAGA 


AAACGGTGAT 


GCTGAACATA 


CAATTGATAG 


TGTACTAACA 


AACGAAAATC 


3660 


v-»A J. J. 1 AG X G A 


CCCTATAAAG 


CAAAAGTTCC 


CAAATGACGC 


CAACGCAGAA 


TCTGCTTCCA 


3720 


AT AAAT T AC A 

* • x nnn x x 


nunn 1 1 Al* Au 


pp*p TV «T>/^ T»T»/"^ 

LLiVjAIuI XC 


CTCCATCTTC 


CGGACGATCG 


TGATTCGATA 


3780 


TGTACAGAAA 




rAPTATATATAfTAPP 
n^ftftftfti AGO 


ATTTTTTTCT 


TATAGATTAT 


AATACTCTCT 


3840 


CATACGTATA 


CP.TATATP.TP. 
i ni /\ lulu 


T 7A T 7A Tft BTRT 
X A 1 A X un X A X 


7\'T , TVTvTV/"*TVTv'JV/'"* 

AT AAAC AAA C 


ATTAATATCC 


TATTCCTTCC 


3900 


GTTTGAAATC 


CCTATGATGT 


ACTTTGCATT 


GTTTGCACCC 


GCGAATAAAA 


TGAAAACTCC 


3960 


GAACCGATAT 


ATCAAGCACA 


TAAAAGGGGA 


GGGTCCAATT 


AATGCATATT 


TAAGACCACA 


4020 


GCTGAATAAC 


TTTAAAACGG 


CAGACAAAAC 


aaaaaatagg 


TCGAATAAAC 


CTTACCTGCC 


4080 


TAGAAGGAAT 


GACAGCAGCT 


AATAAG 








4106 


(2) INFORMATION FOR SEQ ID NO: 15 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CCGTTTTCTT 


TTCGTAAAAA 


AAAACAATAG 


ACACTATATA 


TAGACACTTT 


TTCCTTTCCT 


60 


TCTTTGCGCG 


ATTTCAAGAG 


GAAAAGCATA 


CTTAAATAAG 


AATATTCCTA 


AAACACACGT 


120 


TCTGACGCGT 


CAATTAGATC 


GTCAGACCTT 


GCATGCAGCC 


AAGACACCCT 


CACATAGCAC 


180 


TGCCTCCTTC 


CTCCTCTTTT 


CTGTCACCAC 


CTCACCTCCC 


TCGTCCACTC 


AACTGAGTGG 


240 


CTTTTCGCTC 


CTTTTATACT 


GCGCCATGAG 


TAGTTTTCGT 


TTCACTGATG 


TGTCCGAAAA 


300 


AATTGAGGTT 


TCATAAAAAA 


ATTCGTGGAC 


TTATTTATGG 


AGAAACAGGG 


AAATCCGACT 


360 


ACTTAAGAAA 


AGGGTGTCAA 


AGAGGATTTA 


CTTTTTTCCT 


TCTTTTTGCA 


TTTGTTCCTA 


420 


TTTCCGCAAT 


TGGACGGTTA 


TTAAGAAGAA 


CGCAATTGGC 


TTTTCTGTAT 


ATTAAAATAC 


480 


ATAGCGTAAT 


AAAAAGATAA 


GGTGAACACC 


AAG CAT AT AG 


TATAATATTA 


CCTACCACAT 


540 


ATGAAACTGA 


AAACTGTAAG 


ATCTGCGGTC 


CTTTCGTCAC 


TCTTTGCATC 


GCAGGTTCTC 


600 


GGTAAGATAA 


TACCAGCAGC 


AAACAAGCGC 


GACGACGACT 


CGAATTCCAA 


GTTCGTCAAG 


660 


TTGCCCTTTC 


ATAAGCTTTA 


CGGGGACTCG 


CTAGAAAATG 


TGGGAAGCGA 


CAAAAAACCG 


720 


GAAGTACGCC 


TATTGAAGAG 


GGCTGACGGT 


TATGAAGAAA 


TTATAATTAC 


CAACCAG CAA 


780 


AGTTTCTATT 


CGGTGGACTT 


GGAAGTGGGC 


ACGCCACCAC 


AGAACGTAAC 


GGTCCTGGTG 


840 


GACACAGGCT 


CCTCTGATCT 


ATGGATTATG 


GGCTCGGATA 


ATCCATACTG 


TTCTTCGAAC 


900 


AGTATGGGTA 


GTAGCCGGAG 


ACGTGTTATT 


GACAAACGTG 


ATGATTCGTC 


AAGCGGCGGA 


960 


TCTTTGATTA 


ATGATATAAA 


CCCATTTGGC 


TGGTTGACGG 


GAACGGGCAG 


TGCCATTGGC 


1020 


CCCACTGCTA 


CGGGCTTAGG 


AGGCGGTTCA 


GGTACGGCAA 


CTCAATCCGT 


GCCTGCTTCG 


1080 


GAAGCCACCA 


TGGACTGTCA 


ACAATACGGG 


ACATTTTCCA 


CTTCGGGCTC 


TTCTACATTT 


1140 


AGATCAAACA 


ACACCTATTT 


CAGTATTAGC 


TACGGTGATG 


GGACTTTTGC 


CTCCGGTACT 


1200 


TTTGGTACGG 


ATGTTTTGGA 


TTTAAGCGAC 


TTGAACGTTA 


CCGGGTTGTC 


TTTTGCCGTT 


1260 


GCCAATGAAA 


CGAATTCTAC 


TATGGGTGTG 


TTAGGTATTG 


GTTTGCCCGA 


ATT AG AAG TC 


1320 


ACTTATTCTG 


GCTCTACTGC 


GTCTCATAGT 


GGAAAAGCTT 


ATAAATACGA 


CAACTTCCCC 


1380 


ATTGTATTGA 


AAAATTCTGG 


TGCTATCAAA 


AGCAACACAT 


ATTCTTTGTA 


TTTGAACGAC 


1440 


TCGGACGCTA 


TGCATGGCAC 


CATTTTGTTC 


GGAGCCGTGG 


ACCACAGTAA 


ATATACCGGC 


1500 


ACCTTATACA 


CAATCCCCAT 


CGTAAACACT 


CTGAGTGCTA 


GTGGATTTAG 


CTCTCCCATT 


1560 


CAATTTGATG 


TCACTATTAA 


TGGTATCGGT ATTAGTGATT CTGGGAGTAG 


TAACAAGACC 


1620 


TTGACTACCA 


CTAAAATACC 


TGCTTTGTCG 


GATTCCGGTA 


CTACTTTGAC 


TTATTTACCT 


1680 


CAAACAGTGG 


TAAGTATGAT 


CGCTACTGAA 


CTAGGTGCGC 


AATACTCTTC 


CAGGATAGGG 


1740 


TATTACGTAT 


TGGACTGTCC 


ATCTGATGAT 


AGTATGGAAA 


TAGTGTTCGA 


TTTTGGTGGT 


1800 


TTTCACATCA 


ATGCACCACT 


TTCGAGTTTT 


ATCTTGAGTA 


CTGGCACTAC 


ATGTCTTTTA 


1860 


GGTATTATCC 


CAACGAGTGA 


TGACACAGGT 


ACCATTTTGG 


GTGATTCATT 


TTTGACTAAC 


1920 
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GCGTACGTGG TTTATGATTT GGAGAATCTT GAAATATCCA TGGCACAAGC TCGCTATAAT 1980 

ACCACAAGCG AAAATATCGA AATTATCACA TCCTCTGTTC CAAGCGCCGT AAAGGCACCA 2040 

GGCTATACAA ACACTTGGTC CACAAGTGCA TCTATTGTTA CCGGTGGTAA CATATTTACT 2100 

GTAAATTCCT CACAAACTGC TTCCTTTAGC GGTAACCTGA CGACCAGTAC TGCATCCGCC 2160 

ACTTCTACAT CAAGTAAAAG AAATGTTGGT GATCATATAG TTCCATCTTT ACCCCTCACA 2220 

TTAATTTCTC TTCTTTTTGC ATTCATCTGA AAACCGTTGC ACAAAGTTTA GACATTCACA 2280 

TCTCCAAGCC AGTTGGAGTT TCTGGCGGAA ATCGTTGCTC TCGCTTGGGC AAAGTTTTTT 2340 

TTTATTATTA ATTTTTTATT GTTACGTTGG CGGTCTTTAT TTTTACTTCA CAATAGTTTA 2400 

TCTTACCCAC TAAGAATAGG TTACCATTTA TTCACATTTT TTTTTCTCAT TCCTAGTATA 2460 

CTATTTACCT GGGATATGGC CTATAATCAA AGGCTTTAAT ATTCTAATAA TTCGTTTGGC 2520 

ATCTAG 2526 
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CLAIMS 

1. A process for preparing albumin by secretion from a yeast genetically 
modified to produce and secrete the albumin, comprising culturing the 

5 yeast in a culture medium such that albumin is secreted into the 

medium, characterised in that the yeast cells have a reduced level of 
yeast aspartyl protease 3 proteolytic activity. 

2. A process according to Claim 1 wherein the said proteolytic activity 
10 is an endoprotease activity specific for monobasic sites and for paired 

basic amino acids in a polypeptide. 

3. A process according to Claim 1 or 2 wherein the yeast is 5. 
cerevisiae. 

15 

4. A process according to Claim 1, 2 or 3 wherein the yeast lacks a 
functional YAP3 gene or homologue thereof. 

5. A process according to any one of Claims 1 to 4 wherein the yeast 
20 cells additionally have a reduced level of S. cerevisiae Kex2p 

proteolytic activity. 

6. A process according to any one of the preceding claims wherein the 
albumin is a human albumin. 



25 



30 



A culture of yeast cells containing a polynucleotide sequence encoding 
an albumin and a second polynucleotide sequence encoding a secretion 
signal causing albumin expressed from the first polynucleotide 
sequence to be secreted from the yeast, characterised in that the yeast 
cells have a reduced level of yeast aspartyl protease 3 proteolytic 
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activity. 



10. 



10 



11. 



A culture according to Claim 7 wherein the albumin is a human 
albumin. 



A culture according to Claim 7 or 8 wherein the yeast is S. 

cerevisiae. 

A culture according to any one of Claims 7 to 9 wherein the said 
signal is cleaved by the yeast prior to release of the albumin from the 

yeast. 

A culture according to any one of Claims 7 to 10 wherein the yeast 
cells additionally have a reduced level of Kex2 P proteolytic activity. 

A culture according to Claim 11 wherein the said secretion signal is 
cleaved from the albumin by a protease other than Kex2p. 

A modified albumin having at least 90% sequence identity to a 
naturally-occurring albumin, which naturally-occurring albumin is 
susceptible to cleavage with yeast aspartyl protease 3 (Yap3p) when 
expressed and secreted in yeast, characterised in that the modified 
albumin is not susceptible to such cleavage. 

25 14. A modified albumin according to Claim 13 wherein the modified 
albumin lacks a monobasic amino acid present in the naturally- 
occurring albumin protein. 



15 



12. 



13. 



20 



30 



15. A modified albumin according to Claim 13 or 14 wherein the said 
monobasic amino acid is arginine. 
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16. A modified albumin according to Claim 14 or 15 wherein the 
modified albumin additionally lacks a pair of basic amino acids 
present in the naturally-occurring albumin. 

5 17. A modified albumin according to Claim 16 wherein the said pair of 
ammo acids is Lys, Lys; Lys, Arg; Arg, Lys; or Arg, Arg. 

18- A modified albumin according to Claim 13 wherein the naturally- 

occur^ng albumin is a human albumin and the modified protein lacks 
Arg ; and, optionally, residues 413 and 414 do not each consist of 
lysine or arginine. 

19- A modified albumin according to Claim 18 which is a human albumin 
having the amino acid changes R410A, K413Q, K414Q. 



20. 



A polynucleotide encodi„ s . modified a|bumi „ ^ 

of Claims 13 to 19. 



2'- A ^'»"^ning a polynucleotide accorii ng ,o Claim 20 

sisals such tna, lne modified aIbumi „ . e ^ 

*e yeas,, and a further polynucleotide adjacent the said 

polynucleotide such tha, the modified albutnin is secreted from ^ 

yeast. 
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Figure 2 
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Figure 3 

5' and 3' regions of YAP3 obtained by PCR: 
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Figure 4 
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. .Figure 6 



5* and 3* regions of KEX2 obtained by PCR: 
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